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Abstract 

In designing a network to link n points in a square of area n, one 
might be guided by the following two desiderata. First, the total net- 
work length should not be much greater than the length of the shortest 
network connecting all points. Second, the average route length (taken 
over source-destination pairs) should not be much greater than the average 
straight-line distance. How small can we make these two excesses? Speak- 
ing loosely, for a non-degenerate configuration the total network length 
must be at least of order n and the average straight-line distance must 
be at least of order n 1 / 2 , so it seems implausible that a single network 
might exist in which the excess over the first minimum is o(n) and the 
excess over the second minimum is o(n 1/2 ). But in fact one can do better: 
for an arbitrary configuration one can construct a network where the first 
excess is o(n) and the second excess is almost as small as O(logn). The 
construction is conceptually simple and uses stochastic methods: over the 
minimum-length connected network (Steiner tree) superimpose a sparse 
stationary and isotropic Poisson line process. Together with a few addi- 
tions (required for technical reasons), the mean values of the excess for 
the resulting random network satisfy the above asymptotics; hence a stan- 
dard application of the probabilistic method guarantees the existence of 
deterministic networks as required (speaking constructively, such networks 
can be constructed using simple rejection sampling). The key ingredient 
is a new result about the Poisson line process. Consider two points at 
distance r apart, and delete from the line process all lines which sepa- 
rate these two points. The resulting pattern of lines partitions the plane 
into cells; the cell containing the two points has mean boundary length 
~ 2r + constant x log r. Turning to lower bounds, consider a sequence of 
networks in [0, \/n\ 2 satisfying a weak equidistribution assumption. We 
show that if the first excess is 0(n) then the second excess cannot be 
o(Vlogn). 
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1 Introduction 

We start with a counter-intuitive observation and its motivation, which prompted 
us to probe more deeply into the underlying question. 

Consider n points ("cities", say) in a square of area n. Using the terminology 
of computer science, we are interested in both the worst-case setting where 
the points are located arbitrarily in the square, and the average case setting 
where the points are random, independent and uniformly distributed. Consider 
a connected network (a road network, say), made up of a finite number of 
straight line segments and linking these n points and perhaps other junction 
points. Recall that the minimum length connected network on a configuration 
of points x™ = {xi, . . . , x n } is the Steiner tree ST(x"). 

It is well known and straightforward to prove [9l [11] that in both the worst 
case and the average case the (mean) total network length len(ST(x™)) grows as 
order 0(n). When designing a network, it is reasonable to regard total network 
length as a "cost" . A natural corresponding "benefit" would be the existence (in 
some average sense) of short routes between points. Let £(xi,Xj) be the route- 
length (length of shortest path) between points Xi and Xj in a given network, 
and let \xi — Xj\ denote Euclidean distance (so £(xi,Xj) > \x{ — Xj\). A good 
network should satisfy the following informal criterion: 

The short routes property: Averaging over pairs chosen 
uniformly at random, the route- length £(xi,Xj) between points x% 
and Xj is not much larger than the Euclidean distance \xi — Xj\. 

A first take on a statistic to measure this property for a connected network 
G(x n ) is the ratio statistic, based on averaging the ratios of network route- 
lengths versus Euclidean distances. Consider a network G(x") to be the config- 
uration of points x" = {xi, . . . , x n } together with a collection of line segments 
which combine to connect every Xi to every other Xj . 

Definition 1 (Ratio statistic). Let average^) denote the average over all dis- 
tinct pairs Then 

ratio(G(x")) = average - 1 > 0. (1) 

Consider a network G(x n ) based on n uniform random points x™ c [0, \fn] 2 , 
having (say) twice the total length of the Steiner tree. Initially we speculated 
that in this case the expectation E [ratio(G(x™))] would at best converge to some 
strictly positive constant as n — » oo. However this intuition is wrong: 



2 



Counterintuitive observation (see section I5.3|) . It is possible 
to construct networks over well-dispersed configurations whose to- 
tal lengths are greater than the corresponding Steiner tree lengths 
by only an asymptotically negligible factor, but for which the ra- 
tio statistic converges to zero as total network length converges to 
infinity. 

These considerations were originally motivated by analysis of real-world net- 
works. Consider for example the "core" part of the UK rail network; that part 
which links the 40 largest cities. Given a statistic R designed to capture the 
"short routes" property, one can then consider how closely the observed value of 
R approaches optimality. Of course the real network has evolved according to a 
complex historical process heavily influenced by topography; nevertheless it is 
of interest to consider whether its value of R is close to the minimum possible 
value of R taken over all possible networks connecting the 40 cities but of no 
greater total length. 

One is then led to ask what statistic R might best capture the imprecisely ex- 
pressed "short routes" property, and our consideration of n cities in an idealised 
square [Oj-^/n] 2 is designed to illuminate this question. The above counterin- 
tuitive observation can be interpreted as implying that the ratio statistic of 
Definition Q] is probably not a good choice of statistic, because we prove this 
observation by constructing networks which are approximately optimal by this 
criterion and yet are plainly rather different from many plausible real-world 
networks. What is a good choice of statistic will be discussed in a companion 
paper, along with some real-world examples. 

Informally, the counter-intuitive observation suggests that we can construct 
networks for configurations of n points which have total network length ex- 
ceeding that of the Steiner tree by just o(n), and such that the average excess 
of network distance over Euclidean distance is o(n 1 ^ 2 ) (bearing in mind that 
average Euclidean distance for "evenly spread out" configurations should be 
0{n 1 / 2 )). In fact much more is true: whatever the configuration of n points in 
[0, ^/n\ 2 (hence, even in "worst case" scenarios) we can construct such networks 
with average excess of network distance over Euclidean distance barely more 
than O(logn). This we can work on an additive rather than a multiplicative 
scale: 

Definition 2 (Excess average length for a network). The excess route length 
for a network G(x") is 

excess (G(x™)) = average (£(xi 7 xj) — \xi — Xj\) . (2) 

Theorem 1 (Upper bound on minimum excess network length) . For each n let 
x n be an arbitrary configuration of n points in a square of area n. The following 
asymptotics hold for large n: 

(a) Let w n — > oo. There exist networks G{x n ) connecting up the points such 
that 
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(i) \en(G(x n )) - len(ST(x n )) = o{n); 

(ii) excess(G(a:™)) = o(w n \ogn). 

(b) Let e > 0. There exist networks G{x n ) connecting up the points such that 

(i) len(G(a:™)) - lcn(ST(a; rl )) < en; 

(ii) excess(G(x n )) = O(logn). 

This result is proved in Sections and [3J The idea is to build a hierarchical 
network. Details are given at the start of Section [3J but here is a sketch. At 
small scales routes use the underlying Steiner tree. At large scales, routes use 
a sparse collection of randomly oriented lines (a realisation of a stationary and 
isotropic Poisson line process); this is the key ingredient that permits an excess 
of at most o(w n log(n)), respectively 0(log(n)) (Section [5]). We believe that 
only these two scales are needed, but to simplify matters (so as to avoid non- 
elementary analysis of Steiner trees and geodesies in Poisson line networks) we 
introduce an intermediate scale consisting of a widely-spaced grid. Thus a route 
from an originating city navigates through the Steiner tree to a grid line and 
then along the grid line to a line of the Poisson line process, and then navigates 
in the reverse sense down to the destination city. (For technical reasons the 
discussion in Section [3J also introduces occasional small rectangles to permit 
circumnavigation around Steiner tree "hot-spots"). The key ingredient in the 
analysis is a calculation concerning the Poisson line process, which has separate 
interest as a result in stochastic geometry (Theorem [4] below) . Consider two 
points at distance r apart, and delete all lines from the line process which 
separate these two points. The resulting pattern of lines partitions the plane 
into cells; the cell containing the two points has mean boundary length which 
for large r is asymptotic to 2r + constant x logr. 

Note that randomness arises only through use of the Poisson line process to 
supply a relatively small number of long straight connections; the point pattern 
x™ is arbitrary. The probabilistic method may now be used to prove the exis- 
tence of a non-random networks satisfying the asymptotics described in Theorem 
[TJ based on applying Markov's inequality to the expectations E [len(G(x n ))] — 
len(ST(x")) = o(n), et cetera. 

For lower bounds it is necessary to impose some condition on the empirical 
distribution of the points in x™, since if all the points concentrate on a line 
then the excess is zero! We need a quantitative condition on equidistribution of 
points over a region, formalised via the following truncated Vasershtein coupling 
scheme. 

Definition 3 (Quantitative equidistribution condition). Let x n for varying n 
form a sequence of configurations in the plane, let /x™ be a probability measure 
on the plane, and and let L n > 0. Say x n is L n -equidistributed as fi n if there 
exists a coupling of random variables (X n ,Y n ) such that 

(a) X n has uniform distribution on the finite point-set x ra , 

(b) Y n has distribution /i", 
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(c) E min(l, 



clS ft — > oo . 



A sufficient condition for the following result is that x™ is L„-equidistributcd 
as the uniform distribution on the square of area n, for some L n — o(^/logn). 
The purpose of introducing the non-uniform distribution pi 1 in Definition [3] is 
to permit us to express Theorem [2] below in terms of weaker and more local 
conditions: for example a consequence of Theorem [2jb) is that we may replace 
the uniform reference distribution by any distribution p on [0, l] 2 with a con- 
tinuous density component, rescaled to produce a distribution p n on [Q,n 1 / 2 ] 2 . 
In particular the geometry of [CI,™ 1 / 2 ] 2 plays no role in this result. 

We choose to express Definition [3] in stochastic terms purely for convenience 
of exposition. For example, arguments using the connection of total variation 
to coupling show that x" is L„-equidistributed as the uniform distribution on 
[o, y/n] 2 if the following non-stochastic condition is satisfied: for some sequence 
of numbers A n — > oo with X n /L n — > and n/A 2 being integral, 
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Tlbox)-A 2 | -> 0, 



with the sum being taken over n/A 2 boxes partitioning [o, \fn\ 2 into cells of 
sidelength A„ . Thus a wide range of possible point patterns can be seen to be 
L n -equidistributed in the above sense. 

Theorem 2 (Lower bound on minimum excess network length). Let £C™ be a 

configuration of n points in a square [0, \/n\ 2 . Let L n — o(y/logn). Suppose 
either 

(a) x n is L n -equidistributed as the uniform distribution on the square of area 
n : 

or (more generally) 

(b) for some fixed p and e, there is a subcollection j/ '™) of k(n) points, all 
lying in a disk D n of area irpn, such that k{n) > irpne, and such that 
y k ( n ^ is L n -equidistributed as the uniform distribution on D n . 

Let G(x n ) be a network based on the full collection of n points. Iflen(G(x n ))/n 
remains bounded as n — > oo, then 



excess(G(a;")) = n(v%gn) • (3) 



(Thus, liminfn^oo excess ( G ( x n ) ) / Vlog n > 0.) 

Configurations x™ produced by independent uniform sampling from [0, \/ri] 2 
satisfy the conditions of this theorem (see Remark [5]) , but so will many other 
configurations exhibiting both clustering and repulsion. The proof of the the- 
orem is given in Section [U and exploits a tension between the two following 
facts: 
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(a) An efficient route between Xi and Xj must run approximately parallel to 
the Euclidean geodesic, and hence will tend to make almost orthogonal 
intersections with random segments perpendicular to this geodesic. 

(b) On the other hand, the equidistribution condition means that two points 
Xi and Xj randomly chosen from the subcollection must be nearly inde- 
pendent uniform draws from D n , which permits the derivation of upper 
bounds on the probability of nearly orthogonal intersections of the form 
given in fact (a). 

Finally, one might hope to improve the result by imposing a more restrictive 
assumption than the requirement that len(G(x n )/n remains bounded as n — > oo. 
This requirement is weaker than either of our two alternative assumptions on 
len(G(x"))-len(ST(x n )) in the upper bound (since len(ST(x n )) = 0[n)). How- 
ever we are unable to improve ([3]) under either of the two stronger assumptions. 

2 The Poisson line process network 

Our upper bound on minimal excess (G(x™)) is based on a result from stochastic 
geometry (Theorem U below) which is of independent interest. 

Recall that a Poisson line process in the plane R 2 is constructed as a Poisson 
point process whose points lie in the space which parametrises the set of lines 
in the plane. We will consider only undirected lines, which will be parametrised 
by (r, 6) GM.X [0, ir) where r is the signed distance from the line to a reference 
point and 9 is the angle the line makes with a reference axis. A stationary and 
isotropic Poisson line process has intensity measure invariant under rotations 
and translations of M 2 : a stationary and isotropic Poisson line process II of unit 
intensity is one for which the number of lines of II hitting a unit segment has 
expectation 1 (further facts about Poisson line processes may be found in [To], 
Chapter 8]). We are interested in the cell containing two fixed points which is 
formed by the lines of II that do not separate the two points, because this can 
be used as the efficient long-distance part of a network route between the two 
points (see Lemma [3T3f . Theorem [4] establishes an asymptotic upper bound for 
the length of the mean cell perimeter in case of wide separation between the 
two points; we prepare for this by using a Buffon argument to derive an exact 
double-integral expression for the mean cell perimeter length: 

Theorem 3 (Mean perimeter length). Let H be a stationary and isotropic 
Poisson line process of unit intensity. Fix two points Vi, Vj which are distance m 
apart. Delete the lines of II which separate the two points Vi, Vj. The remaining 
line pattern partitions the plane: the cell C{vi, Vj) containing the two fixed points 
has mean perimeter W,[lendC(vi,v j)] — 2m+ J m , where J m is given by the double 
integral 
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J m = E [len dC(vi, Vj)]— 2m — — jj (0 — sin <f>) exp (— ^(77 — to)) da; . 



(4) 



Here 77 = rj{x) is a sum of distances \v% — x\ + \vj — x\, while <fi = <fi(x) is the 
exterior angle at x of the triangle with vertices x, Vi, Vj (see Figure\^. 




Figure 1: Definition of r\ and 4>. Note that 4> is the sum of the two interior 
angles tp and 9. 



Proof. This proof can be phrased in terms of measure-theoretic stochastic geom- 
etry, using the language of Palm distributions and Campbell measure. Since we 
deal only with constructions based on Poisson processes, we are able to adopt a 
less formal but more transparent exposition, for the sake of a wider readership. 

Let s be the line segment of length to with end-points Vi, Vj. The idea of 
the proof is to measure E [len dC(vi, Vj)] by computing the expected number of 
hits on dC{vi,Vj) made by an independent homogeneous isotropic Poisson line 
process II, again of unit intensity. Each hit corresponds to one of the points in 
the intersection point process X — {i(£,£) : £ £ H,£ € II}, where 

l{£J) = l x i"n ?={*}, 

1 undefined if £, £ are parallel. 
Note that with probability 1 the intersection point l(£, £) is defined for all £ £ II, 

££ n. 

Not all intersection points x £ X correspond to hits on dC{vi,Vj). The 
condition for x = l(£, £) £ X to represent a hit on dC(vi 7 Vj) is that £ should not 
hit s (for otherwise it cannot be involved in the construction of dC(vi, Vj)) and 
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that x is not separated from s by any line from II \ {£}. Recall that the Slivynak 
theorem [TUl §4.4, example 4.3] implies that H\ {£} conditional on £ £ II is itself 
a homogeneous isotropic unit-rate Poisson line process. Consequently, under the 
condition that £ does not hit s, the conditional probability of x = l(£,£) £ X 
representing a hit on dC(vi,Vj) is equal to the probability p(x) of there being 
no line in II which cuts both the segment from Vi to x and the segment from Vj 
to x. 

A classic counting argument from stochastic geometry then reveals that 

p(x) = exp - x\ + \vj - x\ - m)) = exp (- 5 (r/ - m)) . (6) 

Accordingly, if v is the intensity of the point process X then we may compute 
the mean number of hits on dC(vi, Vj) as 



Here "£ $ s" stands for "the line £ does not hit s" - noting that the conditioning 
in this context forces the Poisson line £ to pass through x but does not fix its 
orientation - and on the right-hand side the summand 2m corresponds to the 
fact that hits of II on s count as automatic double hits on dC(vi, Vj). 

Condition on x = l(£, £) £ X (which is to say, condition on there being 
Poisson lines £ £ II, £ £ II both passing through x) and consider 

(a) the angle £1 made by £ with the line through m and x; 

(b) the angle £2 between I and I. 

By isotropy of II the random angle £1 is Uniform(0, 7r). Conditional on £1 
and more generally on II with an I £ II passing through x, the intersection 
of II with £ is a Poisson point process on £ of unit intensity. Moreover if the 
intersection points are marked with angles of intersection £2 then the mark £2 
has mark density ^sin^ over £2 S [0, 7r) (consider the length of the silhouette 
of a portion of£ viewed at angle £2)- Hence the conditional distribution of £2 for 
x = i{£,£) has density isin£ 2 over £ 2 € [0,7r), and so we can compute (working 
with £2 modulo 7r) 



where (f> = 9 + ip is the exterior angle at x of the triangle formed by x, i>i, Vj 
(see Figure [T]). 




P £Ks,£j(s\x = l(£,£) 
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Finally the intensity v of X can be computed as ? , for example by computing 
the mean number of hits of the unit disk by II, then by computing the average 
length of the intersection of the disk with a line of II conditional on that line 
hitting the disk. Thus 

J m = E [len(5C (v, ,Vj))}- 2m 

?\£$s,Itfs\x = o(e,I) € x\ exp (-±(77-771)) dx 



] exp (— ±(77 — to)) dx (9) 



as required. □ 

We now state and prove the main result of this section: an O (log to) upper 
bound on the mean perimeter excess length J m . 

Theorem 4 (Asymptotic upper bound on mean perimeter length). The mean 
perimeter excess length J m is subject to the following asymptotic upper bound: 

Jm < O(logm) as to — ► 00 . (10) 

Proof. Without loss of generality, place the points Vi and Vj at (—^,0) and 
(tt, 0). The double integral in (|4]) possesses mirror symmetry about each of the 
two axes, so we can write 

J m — 2 J J ((f> — sin^) exp (— |(r? — rrij) dx 

[0 : oo) 2 

/•tt/2 f ' f j sec 9 

= 2 / (cj> — sin0) exp (— ^(77 — to)) rdr d9+ 

Jo Jo 

+ 2 1 I ((f)- sin0)exp (—^(77 - to)) rdrdO (11) 

Jir/2 JQ 

(using polar coordinates (r, 6) about the second point Vj located at (^,0)). 
The integrand in the second summand is dominated by 7rexp(— 5) which 
is integrable over (r, 8) E (0, 00) x (§,7r). (In this region geometry shows that 
77 — to > r(l — cos9) > r.) Thus we can apply Lebesgue's dominated convergence 
theorem to deduce that the second summand is 0(1) as to — > 00, hence may be 
neglected. 

In fact we can also show that part of the first summand generates an 0(1) 
term: the dominated convergence theorem can be applied for any e £ (0,7r/2] 
to show that 

/>tt/2 sec 9 

2 / ((f> — sin (f>) exp (-i(?7-m))rdrd0 = 0(1) , 

since the integrand is dominated by it exp (— ^(1 — cos 0)) r over the region 
(r, ff) G (0, 00) x (e, -|) (in this region geometry shows that 77 — to > r(l — cos^) > 
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r(l — cose)). Thus for fixed e £ (0, ^) as m — ► oo we have the asymptotic ex- 
pression 

/•£ sec 9 

J m = 2 (0-sm0)e3tp(-§(?7-m))rdrd0 + O(l), 

Jo Jo 

where the implicit constant of the O(l) term depends on the choice ofe > 0. 

Now in the region where < 9 < e and < r < ~ sec 9 we know that 
4> < 29 < 2s, and moreover </> — sin tf> is an increasing function of <j>. Therefore 
there is a constant C E , converging to zero as e — * 0, such that in this region 

(£-sm<£ < 20-sin(20) < i^l! < C £ 1 ~ sinfl . 

Hence (as m — * oo for fixed e > 0) 

sin0) exp (— ^(rj — m)) rdrd0 

- sec 



JO 



/•e p sec p 

< |C e / / (l-cos(9)sin(9exp(-§(l-cos(9))rdrd(9 
Jo Jo 



|C S / ( / se~ s ds ) 7"" 7^ (using s = §(1 - cos0)) 



n 



< |C S / / se-Ms — — (using v = f (sec0 - 1)) 



.^(Bacfl-l) _ \ sinfldfl 

1 — COS 

' ] i /"' , \ 1 dz; 

1 + 4w/m w 

< |C e log(f (sece-l))+0(l). 

□ 

Remark 1. More careful analysis yields useful o(l)-asymptotics: in fact it can 
be shown that as m — > oo so 

J m = |(logm + 7+|)+o(l), (12) 

where 7 is the Euler-Mascheroni constant: 7 = lim m ^oo ((J)^ r) — logm). 
These o(l)-asymptotics show very good agreement with simulation: see for ex- 
ample the simulation reported in the legend of Figure [2] 



3 A low-cost network with short routes 

In this section we prove Theorem [1] for a given configuration x" C [0, \/n\ 2 
we construct networks G(x n ) for which both len(G(x™)) — len(ST(x ra )) and 
excess(G(x") are small. The network is constructed by augmenting the Stcincr 
tree network ST(x") in a hierarchical manner. The construction is stochastic: 
we construct a random augmentation for which the mean values of these excess 
values obey the desired asymptotics and then apply the probabilistic method 
to establish existence of the desired non-stochastic networks. Working from the 
largest scale downwards, we construct 
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Figure 2: Simulation of semi-perimeters for 1000 independent cells for unit- 
rate Poisson line process, with points located at distance 10 8 units apart. The 
figure is subject to vertical exaggeration: y-axis is scaled at 10 4 times a;-axis. 
Empirical mean excess semi-perimeter is 27.63 with standard error ±0.28, versus 
predicted mean excess semi-perimeter 27.5528 (using o(l)-asymptotics). 

1. a stationary and isotropic Poisson line process H of intensity n, where 
r\ will be small: note that this can be constructed from a unit intensity 
process by scaling by a magnification factor of 1/rj. A simple computation 
[TUl Section 8.4] shows that the mean total length of the intersection of 
the resulting line pattern with [0, \/n\ 2 equals irrjn. 

2. A medium-scale rectangular grid with cell side-length s n ~ (logn) 1 / 3 . To- 
tal length of this grid in [0, y/n\ 2 is bounded above by 

2(1 + ^)V^ = o(n). 

3. The Steiner tree ST(af). 

4. A small number (at most n/2) of small hot-spot cells based on a small- 
scale rectangular grid with cell side- length t n ~ j — \i/e ■ A cell in this 
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Figure 3: Illustration of construction of network to deliver an upper bound on 
mean excess route-length. Points are indicated by small circles. In this figure 
there is just one hot-spot cell. 



grid is described as a hot-spot cell if it contains two or more points. These 
hot-spot cells are used to by-pass regions where the Steiner tree might 
become complicated and expensive in terms of network traversal. We add 
further small segments connecting each hot-spot cell perimeter to points 
within the hot-spot cell. Total length of these additions can be bounded 

by ^ t 

Thus the mean excess length of this augmented network is o(n) + nTjn. The 
construction is illustrated in Figure [3l Note that we can choose s n and t n 
such that n 1 / 2 / s n and s n /t n are integers, so that the small-scale lattice is a 
refinement of the medium-scale lattice, which itself refines the square [0, \/n\ 2 . 

3.1 Worst-case results for Steiner trees 

We first record two elementary results on Steiner trees. The first result bounds 
the length of a Steiner tree in terms of the square-root of the number of points 
(for the planar case). 

Lemma 3.1. Consider a configuration x k of k points in a square of side r: 
there is a constant C\ not depending on k or r such that 

len(ST(a; fc )) < dVkr . (13) 

Proof. See [SJ Section 2.2]. □ 

The second result provides a local bound on length contributed by a larger 
Steiner tree in a small square containing a fixed number of points. 
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Lemma 3.2. Consider the Steiner tree ST (af) for an arbitrary configuration 
x n in the plane. Let G be the restriction of the network ST (x n ) to a fixed open 
square of side-length t. Suppose k points x\, . . . , Xk of the configuration x n lie 
within the square. Then 

len(G) < tU + CxVk + l) ■ (14) 

Proof. Let yi, . . . , y m be the locations at which ST (x™) crosses into the interior 
of the square. (Note: m — is possible if {xi, . . . , Xk} = x": in this case choose 
yi arbitrarily from the perimeter of the square.) Then 

by minimality of ST (x n ), 
using square perimeter, 

using the previous lemma. 

□ 



len(G) < len(ST({xi,...,a; fe , yi,..., y m })) 

< len(ST({x 1 ,...,x k ,y 1 }))+4t 

< tU + dVk + l] 



3.2 Route-lengths in the medium-large network 

The part of the construction involving the medium-scale grid and the Poisson 
line process is useful in variant problems, so we separate out the following esti- 
mate involving these ingredients. 

Lemma 3.3. Let n 1 / 2 / s n be an integer. Consider the superposition of the 
rectangular grid with cell side-length s n and the Poisson line process of intensity 
T), intersected with the square [O,™ 1 / 2 ] 2 . Let v l , Vj be vertices of the grid. Then 

E [route-length Vi to Vj] < \v{ — Vj\ + C*2^ log(?7V / 2n) 
for an absolute constant C^. 

Proof. Let C(vi,Vj) be the cell of n containing Vi and Vj (having deleted lines 
from II which separate Vi from Vj). Let R(vi,Vj) be the rectangle bounded by 
Vi and Vj ; then by convexity the route- length from Vi to Vj is bounded above by 

-len9(i?(t; i ,^)nC(ui ) v,-)) < -z lendC(vi, vj) , 

whose mean value can be computed by recognising that the Poisson line process 
is a rescaled version of a homogeneous isotropic unit rate Poisson line process. 
Hence by scaling the asymptotic upper bound of Theorem 0] we have 



E 



■ len d {R{vi, Vj) n C(uj, Vj)) 



1 

2 

< oQlog^l^-^l)") = O log (pVtoty 



□ 
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3.3 Navigating the augmented network 

We now explain how to move from points of x" up to a vertex of the medium- 
scale grid. 

Given Xi <E x™, if this is in one of the hot-spot cells then move to the 
perimeter of the hot-spot cell and thence to a suitable point of departure on 
the perimeter, with route-length at most %t n . Now move along the Steiner 
tree within the relevant medium-scale grid box to the box perimeter; however 
by-pass all hot-spot cells. There are (s n /t n ) 2 = ((logn) 1 / 3 (logn) 1 / 6 ) = logn 
small squares each of which involves a route-length of either 2t n (if a hot-spot 
box which will be by-passed) or t n (4 + Ciy/2) (if not, by Lemma f3.2[) . Hence 
the total trip to the medium-scale grid box perimeter (including emergence from 
the initial hot-spot, if required) has length at most 

|<n + t„(4 + CiV2)x4/4 ^ §£ n + (4 + CiV2)x(logn) 5 / 6 = o(logrc). 

Furthermore the route length from perimeter to vertex of medium-scale grid box 
is at most \s n ~ ^(logn) 1 / 3 = o(logn) . So for each Xi there is a medium-scale 
grid vertex v% for which route-length from x% to Vi is o(log n) . Combining with 
Lemma 13.31 and noting that the medium-scale grid geometry forces \v% — Vj\ < 
\xi — Xj \ + 2^7=, we find 

E [route-length from Xi to Xj] — \xi — Xj\ < V2s n + o(logn) + 6*2^ log (^q^/2n^ . 

Averaging over the points of x", it follows that the dominant contribution comes 
from the cell semi-perimeters, and indeed 

E [cxcess(G(x"))] < O (± log (pV^i) ) , 

at a cost in terms of network length which exceeds len(ST(x™)) by a stochastic 
quantity of mean 777771 + o(n) . 

The two different results of Theorem [1] follow by choosing 77 to behave in two 
different ways: 

(a) either rj — > 0, r)w n — * 00, 

(b) or T] = e > 0. 

In either case we can apply the probabilistic method to exhibit existence of 
the required deterministic networks for cases (a) and (b) of Theorem [TJ For 
example in case (a) it is then the case that E [len(G(x")) — len(ST(x™))] < nc n 
and E [excess(G(x™))] < c n w n \ogn for some c„ — > 0. But then for any fixed 
n we can apply Markov's inequality: P[len(G(x™)) — len(ST(x")) > 3nc„] < -| 
and P [excess (G(x™)) > 3c n w n log n] < |. Hence there is positive probability 
that the random network satisfies both len(G(x™)) — len(ST(x")) < 3nc n and 
excess(G(x™)) < 3c n w n logn, hence such a network exists for each n. 

We can view these applications of Markov's inequality as indicating a simple 
rejection sampling algorithm to be used to generate the required sequence of 
networks. 
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4 A lower bound on average excess route-length 



In this section we prove Theorem [2] The proof is divided into four parts. Firstly 
(Subsection 14. ip we show how to reduce the problem to an analogous case in 
which the excess is computed for two random points drawn independently and 
uniformly from the whole disk D n of area irpn given in condition (b) of the 
theorem. Then (Subsection 14. 2[) we show that the network geodesic must run 
almost parallel to the Euclidean geodesic if the excess is small. On the other 
hand (Subsection 14. 3[) we can use the uniformity of the two random points to 
control the extent to which network segments can run both close to and nearly 
parallel to the Euclidean geodesic. Finally (Subsection l4.4[) we use the opposing 
estimates of Subsections 14.21 and 14.31 to derive a proof of the theorem using the 
method of contradiction. 



4.1 Reduction to case of a pair of uniformly random points 

First we indicate how condition (a) of Theorem [2] implies condition (b) . Un- 
der condition (a) we can use the coupling between X n and Y n to show that 
#{x"nD n }/ii — * irp: therefore for large n the number of points in x™ f)D n 
is approximately irpn. On the other hand the same coupling can be used to 
bound the total variation distance between the two conditional distributions 
C{Y n \X n 6 D n ) and £ (Y n \Y n e D n ) = Uniform(iD r j), and to show that this 
bound tends to zero. We can then use rejection sampling techniques to couple 
C(Y n \X n 6 D n ) and Uniform(D„) so that the truncated Vasershtein distance 
tends to zero as n — > oo; as the distance is a metric we can combine this cou- 
pling with the (conditioned) coupling of C (X n \X n e D n ) and C (Y n \X n e D n ) 
to obtain a coupling which satisfies condition (6). 

We now note that it is sufficient to consider the analogous result for a con- 
figuration x™ of n points in the disk D n . For then we can apply the result to 
the lesser configuration y fc ( n ) (for k(n) as given in condition (b) of Theorem ^ 
and obtain 



excess(G(y fe(n) )) = Sl{^logk(n)) = n(^lognpne) = tl(y/logn) , 
while 

excess(G(y fc ("))) = "j" ~ 1} excess (G(x")) 
k(n){k(n) — 1) 

< -, — excess (G(x™)) , 

Trpe{Trpe — 1/raJ 

from which Theorem [3 follows. 

We therefore consider x" C D n being L„-equidistributed as the uniform 
distribution on D n . So by definition there is a coupling (X\, Y±) (here we omit 
dependence on n) where X\ has uniform distribution on x™, Y\ has uniform 
distribution on D n and 



E 



■ , , \X 1 -Y 1 \ 
mm I 1, 



as n — > oo. (15) 



15 



Write (X 2 , Y 2 ) for an independent copy of X\,Y\. In the definition of excess 
it makes no asymptotic difference if we allow j = i in average^ j\ , so we may 
take 

excess(G(x")) = E[£(X U X 2 ) - |Xi - X 2 \] ■ (16) 

Set 

A n = [\Yt -X x \< L n ] n [\Y 2 - X 2 \ < L n ] (17) 
so that by Markov's inequality 

F[A n ] > l-2A n . (18) 

Define g(Yi,Y 2 ) by supposing that Y{ is plumbed in to the network using a 
connection by a temporary line segment with endpoints Yi and Xj. A direct 
computation shows that on the event A n 

t(Yi, Y 2 ) — |Yi — Y 2 \ < 

{i{X u X 2 ) + \Xx - Y t \ + \X 2 -Y 2 \)- (|Xi - X 2 \ -\Xi-Yi\- \X 2 - Y 2 \) 

< e(X 1 ,X 2 )-\X l -X 2 \+4L n . 

Consequently 

E [£(Yx, Y 2 ) — \Yi — Y 2 \;A n ] < excess(G(x")) + AL n . (19) 

By hypothesis L n — o(^/logn), and so the proof of Theorem[2] reduces to showing 
that the left side (the excess for two random points chosen uniformly in the disk) 
is fi(v / logn). 

4.2 Near-parallelism for case of small excess 

We now substantiate our previous remark that the network geodesic must run 
almost parallel to the Euclidean geodesic if the excess is small. 

It is convenient to situate the disk D n in the complex plane C so as to have 
a compact notation for rotations. For t > we define Zt and $ by 

r*\ Y 2 -Y x 
exp (i$) = 



Y 2 -Y 1 \ ' 

Z t = Yi+t x exp (m) . (20) 

Let 7 : [0,£(Yi,Y2)] — * C be the unit-speed network geodesic running from 
Y\ to Y 2 (using the temporary plumbing to move from Y\ to X\ and then again 
from Y 2 to X 2 ). Then (bearing in mind that |7'(i)| = 1) 

1{Y X ,Y 2 ) = / | 7 '( S )|d S > / \ 7 '(r(t))\r'(t)dt, (21) 



where r(t) is the first time s at which ("f(s) — Fi, exp (i$)) = t. (Note that 
our networks are formed from finite collections of line segments. Hence r' will 
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Figure 4: Illustration of construction of Y\_, Y 2 , and Z t . The angles 0(t) and Si, 
62, ■ ■ ■ are computed using the angles of incidence of network segments on the 
perpendicular running through Z t \ T t . x is the minimum of absolute values of 
all such angles of points of intersection within yj 2t\ + X 2 °f %t ■ 

be defined and finite save perhaps at a finite number of times.) This and the 
following constructions are illustrated in Figured! 

Defining 6{t) by sec9(t) — r'(t), and using sec9 > 1 + \0 2 , we deduce 

1 r\Yx-Y 2 \ 

e(Yi,Y 2 ) > |y 1 -y 2 | + _/ 9(t) 2 dt. (22) 

* Jo 

Furthermore we can use Pythagoras and the geodesic property of Euclidean 
line segments to show the following. Let H(t) be the maximum |r| for which, 
for some s, 

j(s) — Z 4 + ir exp (i$) . 

If the excess for the network geodesic from Y\ to Y2 is bounded above by 
t&i, y 2 ) - |y - Y 2 \ < X then H(t) < y/2t X + X 2 - 

Let T t , x be the smallest \6\ such that some network segment intersects the 
perpendicular {Z t + ir exp i$ : r £ R} at angle ir/2 + S and at distance at most 
yj2tx + X 2 from Z t (thus 5 is the angle of incidence of this network segment on 
the perpendicular). If £(Yi, Y%) — \Y\ — Y 2 \ < x an d \Yi —Ys\ > ^y/pn, we can 
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use (|2"2j) to deduce 

£{Y 1 ,Y 2 )-\Y 1 -Y 2 \ > 

2 y o T L d *-2 (tJ x(I^i-^I+I^-F 2 |) . 

(The second summand allows for the temporary plumbing in of connections 
X\Y\ and X2Y2, for which the angle 9(t) € (0, ^) is not controlled by permanent 
network segments). So introduce the event 

B K . X = [£(Y 1 ,Y 2 )-\Y 1 -Y 2 \<x,\Y 1 -Y 2 \> K ^\ (23) 

and recall from Equation (jTTJ) the event A n = nf =1 [|li — Xi\ < L n ], Taking 
expectations, we deduce 

E[£(Y 1 ,Y 2 )-\Y 1 -Y 2 \ ; B K>X n A n ] 



-J E [?* x ■ B K , X n A„] di - — L„ . 
Using integration by parts to replace the expectation by a probability 



> 

" 2 J 



nt(YuY2)-\Y 1 -Y 2 \ ; B Ka nA n ] 



n 2 



4 

K.^ypn poo 



> 

\J~pn poo 



pK^/pn poo 

I P [[T tjX >u]n B K _ X n An] u dudt 

Jo Jo 

(P [B KiX nAJ-P [[T t , x <u]n B KtX n A n ])ududt 





> / / max (P [_B KjX nAJ-P [T t)X < u] , 0) it dudi . (24) 
Jo Jo 

Note also that from the definitions of B K ^ X and A n , using (fl"8)) . (fT9|) and Markov's 
inequality 

1 - P [B KjX n A n ] = 1 - P [A n ] + P [A n \ B K J 

, , excess(G(x™)) + 4L„ , , 

< 2A„ + P [\Yt -Y 2 \< K^/pn] + V V !> -. 25 

X 

To make progress we now need to find an upper bound for P [T t . x < u] and this 
is the subject of the next section. 

4.3 Upper bounds using uniform random variables 

Firstly we compute an upper bound on the joint density of the quantities Z t 
and $ from the previous section, illustrated in Figure [5] 
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Figure 5: Illustration of construction in Lemma |4. 11 



A,] V Z' d*d0, (26) 



Lemma 4.1. Suppose Y\, Y% are independent uniformly distributed random 
points in a disk D n of radius ^/pn and centre in the complex plane C. With 
Z t and $ defined as in (|20[) . the joint density of Z t and $ is given ewer Cx [0, 2tt) 
by 

{t + s{z,4>) f 
2ir 2 p 2 n 2 

where e^ — e 1 ^ is the unit vector making angle <fi with a reference x-axis, and 
s(z, 4>) is the distance from z to the disk boundary dD n in the direction c6 (thus 
in particular z + s(z, fye^ is on the disk boundary) . 

Proof. Express the joint density for Y\, Y 2 as a product of a uniform density 
over D n for Y x and polar coordinates r, (f> about Yi for Y 2 : 

d Vi tt r id, ^ ^ i rdrd(/) 



l[yi e £>„]— l[w + re^ e D n ] 



irpn irpn 



Obtain the result by integrating out the r variable and transforming the y\ 
variable to z by z = y\ + te 1 ^ . □ 
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Corollary 1. The density for Z t and $ (mod 7r) is 



( r At + S(z,(b)f , , (t + s(z,TT + (h)) 2 \ 

h[z-te^e D n ] i + I [z + ie € D n ] ± ^i. x 

, dz d<i> 

wi/i an upper bound 

f(z,tf>) < 4xI[0< 9 K,]^. (28) 

Proof. Equation (|27p follows immediately from adding the two expressions from 
Equation (|26[) for cj> (mod 7r). The upper bound follows by noting 

1. the maximum will occur when z — te^ runs along a diameter as t varies; 

2. furthermore when one of z ± ie^ lies on the disk boundary; 

3. and furthermore when z = is located at the centre of the disk (so t = 
s(z,±4>) = y/pn). 

□ 

Now consider the line segment §t,x centred at Z t , with end-points given 
by the pair ±iy/2t\ + X 2 ex P (*^)> an d consider the rose-of-directions empirical 
measure of angles made by intersections of network edges with this segment: 

"Rt x (A) = # { network intersections on § t x with angle of incidence lying in ^4} 

(here angles are measured modulo 7T, and A C [0, 7r)). We may apply a Buffon- 
type argument to bound E [ft tiX (A)] using Inequality (|2"gj) . Consider the contri- 
bution to the expectation from a fixed line segment of the network of length I: 
the result of disintegrating the integral expression for this according to the value 
of 4> is an integral of f(z, 4>) with respect to z over a region formed by intersecting 
the disk with a parallelogram of base side- length I and height 2\j2tx + X 2 sin a 
(here the angle a depends implicitly on <f> and z). Of course the integral vanishes 
if <f) ^ A. Thus Inequality ([28f yields a bound 

E[%J A )] < -4— x / / 2 V2tx + X 2 sinadadz. 
r/m Jg(x") J a 

For constant x, the event [T t , x < u] is the event [&t,x(§ — u, § + u) > 1] and so 

F[T t , x < u ] < E[3l t , x (f -u,f +«)] < J^ len (g(x")) v/2 ^ + ^ 2xM _ 

(30) 
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4.4 Calculations 

We have assembled the ingredients for the proof of Theorem [3J and so now can 
perform the calculations to get a quantitative lower bound. 

We proceed by contradiction. Suppose that excess (G(x™)) = o(y/logn). 
Inspecting (|23|) we see that we can choose X — Xn = o{^/\ogn) and some small 
k > such that for all sufficiently large n 

P[B K , x nA n ] > 2- 1 / 3 . (31) 

So we can combine (fl9l) and ([24]) (and the fact that 7r 2 /4 < 3) to get 

pK^/pn poo 

excess(G(x™)) + 7L n > J J max \2~ 1/s - P [T t , x < u] , OJ ududt. 
By (T5TJ]) and hypothesis of Theorem [2J there exists a constant B such that 



'[Tt, x <u] < V 12 V^X + X 2 x «. 



Applying the formula J °° max(0, a — /3u)udu = £p we see 
»e ss(G(x , ) + 7£ „ S i jf ^* - MLM. (32) 



Recall this holds under the assumption that x„ = o(vfogn) and that k > 
is constant. We are given that L n = o(yTogri), and we have supposed for the 
purposes of contradiction that excess(G(x")) = o(Vlogn). But then (f32|) takes 
the form 

ri(logn) 



o(viogn) > 



o(Vlogn) ' 



which is impossible. We deduce we must have excess (G(x")) = fi(yTogn). 



5 Closing remarks and supplements 
5.1 Spatial network design 

Within the realm of spatial network design, the closest work we know is that 
of Gastner and Newman [T], who consider the similar notion of a distribution 
network for transporting material from one central vertex to all other vertices. 
They give a simulation study (their Figure 2) of a certain algorithm on random 
points, and comment 

Thus, it appears to be possible to grow networks that cost only a 
little more than the [minimum-length] network, but which have far 
less circuitous routes. 

Our Theorem Q] provides a strong formalisation of this idea. 

An algorithm for minimizing excess for a given length is described in [8], 
where results for a 39 point configuration are shown. But neither this nor [1] 
has led to study of n — > 00 asymptotics. 
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5.2 Fractal structure of the Steiner tree on random points 



A longstanding topic of interest in statistical physics is that of the continuum 
limits of various discrete two-dimensional self-avoiding walks arising in proba- 
bility models, eg 

• uniform self-avoiding walks on the lattice, 

• paths within uniform spanning trees in the lattice, 

• paths within minimum spanning trees in the lattice. 

This study has recently been complemented by spectacular successes of rigorous 
theory [5]. It is conjectured that routes in Steiner trees on random points have 
similar fractal properties [7] : route- length between points at distance n should 
grow as n 7 for some 7 > 1. However, as our construction shows, such results 
have little relevance to spatial network design. 



5.3 The counterintuitive observation 



The counterintuitive observation following Definition [T] follows quickly from the 
work of Theorem Q] Suppose the configuration x" is well-dispersed, in the 
weak sense that for some 7 S (0, 1) we find the number of point pairs within 



n 1 / 2 of each other is o ((t 



(certainly this is the case for most patterns 



generated by uniform random sampling from [0,^/n] 2 ). Consider a network 
G(x n ) produced by augmenting the Steiner tree according to the construction in 
the proof of Theorem[T] Using the properties of this construction, the following 
can be shown 



E [ratio (G(x n ))] 



E 



average ■ 



(Xi , Xj ) 



1 



Fi — x j\ 

< constant x o(n 7_1 ) + (1 — o(n 7-1 )) 
1 log n N 



0(log V2n) 



< O max 



-7 ' n 7/2 



5.4 Derandomisation 

Theorem [T] is a purely deterministic assertion, though our proof used randomi- 
sation (supplied by the Poisson line process) . It seems intuitively plausible that 
one could give a purely deterministic proof, say by replacing the Poisson line 
process with a suitable sparse set of deterministically positioned lines having a 
dense set of orientations. 

5.5 Quantifying equidistribution 

The classical equidistribution property 
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the empirical distribution of {n 1 / 2 a;™, 1 < i < n} converges weakly 
to the uniform distribution on [0, l] 2 

is equivalent (by a straightforward argument) to the property 

x" is L„-equidistributed as the uniform distribution on the square 
of area n, for some L n — o(n 1 / 2 ). 

Replacing one sequence of L n by another slower-growing sequence makes equidis- 
tribution a stronger assumption, and so our assumption in Thcorcm[2ja) (equidis- 
tribution for some L n = o(log 1 ^ 2 n)) is stronger than the classical equidistri- 
bution property. Indeed Theorem [5] fails under the classical equidistribution 
property, as the following example shows. 

Example 5.1. Let L n = n 1 for some 7 € (fj^)- There exist networks G(x") 
which are Z/„-equidistributed as the uniform distribution on the square of area 
n, for which len(G(x™)) = o(n) whilst excess(G(x")) — * 0. 

For example: partition [Ojn 1 / 2 ] 2 into subsquares of side L„/logn, construct 
the complete graph on all centres of such subsquares, allocate the n points evenly 
amongst subsquares and position them arbitrarily close to the centres. 

As is apparent from the non-stochastic condition implying L„-equidistribution, 
there is a wide variety of configurations satisfying L„-equidistribution. Here we 
consider the particular case of independent uniform sampling, and show that 
this generates an L„-equidistributed sequence of configurations. 

Remark 2. Sample the configuration x n independently and uniformly from 
[0, \/n\ 2 - Let L n —>■ 00, perhaps arbitrarily slowly. Then the probability that the 
configuration x™ is L„-equidistributed with the uniform distribution converges 
to 1. This follows by dividing [0, ^n] 2 into cells of side-length asymptotic to 
L n /^/2, by conditioning on x", and by "blurring" the points of x™ by replacing 
each point x G x" by an independent draw taken uniformly from the cell con- 
taining x. Then a uniform random draw Y n of one of the blurred points can be 
coupled to lie within L n of a uniform random draw X n from the finite configu- 
ration x™. A simple argument using the Binomial distribution then shows that 
the total variation distance between Y n and UniformQO, y/n] 2 ) tends to zero; it 
follows that X n can be coupled to a UniformQO, y/n] 2 ) random variable Y n so 
that 

\X n — Y n 



E 



min 1, 



where the convergence takes place in probability. 



0. 



5.6 Poisson line process networks 

Remark [T] indicates that more can be said about the mean semi-perimeter 

I Epm(aC(«i, «,-))] , 

and this will be returned to in later work. For example, consider the network 
formed entirely from a Poisson line pattern. If the pattern is conditioned to 
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contain points Vi, Vj then the perimeter dC(vi,i>j) will be close to providing a 
genuine network geodesic. 

Note that questions about C(u,, Vj) bear a family resemblance to the D.G. Kendall 
conjecture about the asymptotic shape of large cells in a Poisson line pattern. 
However C(vi, Vj) is the result of a very explicit conditioning and hence explicit 
and rather complete answers can be obtained by direct methods, in contrast to 
the striking work on resolving the conjecture about large cells [6, 4. 3, 2]. 



5.7 An open question 

In the random points model we can pose a more precise question. Over choices 
of network G subject to the constraint 

E [len(G(x n )) - len(ST(x"))] = o(n) , 

or the constraint 

E[len(G(x")] = 0(n) , 

what is the minimum value of E [excess(G(x™))]? Our results pin down this 
minimum value, in the latter case to the range [f2 (yTogn) , O(logn)] and in the 
former case the range [CI (Vlog nj , o(w n logn)]. But it remains an open question 
what should be the exact order of magnitude. 
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